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Abstract 

One of the most famous results in the theory of random graphs estabhshes that the threshold 
for Hamiltonicity in the Erdos-Renyi random graph Gn,p is around p ^ log '°s " , Much re- 
search has been done to extend this to increasingly challenging random structures. In particular, 
a recent result by Frieze determined the asymptotic threshold for a loose Hamilton cycle in the 
random 3-uniform hypergraph by connecting 3-uniform hypergraphs to edge-colored graphs. 

In this work, we consider that setting of edge-colored graphs, and prove a result which 
achieves the best possible first order constant. Specifically, when the edges of Gn,p are randomly 
colored from a set of (1 -I- o(l))n colors, with p — Il+£iili]2i^ ^ TffQ show that one can almost 
always find a Hamilton cycle which has the further property that all edges are distinctly colored 
(rainbow). 

1 Introduction 

Hamilton cycles occupy a position of central importance in graph theory, and are the subject of 
countless results. In the context of random structures, much research has been done on many 
aspects of Hamiltonicity, in a variety of random structures. See, e.g., any of [3l [H O [181 l23] 
concerning Erdos-Renyi random graphs and random regular graphs, any of [6llI4 t [T9 l l20j regarding 
directed graphs, or any of the recent developments [9l \TU[ [TTl [T2] on uniform hypergraphs. In this 
paper we consider the existence of rainbow Hamilton cycles in edge-colored graphs. (A set S of 
edges is called rainbow if each edge of S has a different color.) There are two general types of results 
in this area: existence whp0 under random coloring and guaranteed existence under adversarial 
coloring. 

When considering adversarial (worst-case) coloring, the guaranteed existence of a rainbow struc- 
ture is called an Anti-Ramsey property. Erdos, Nesetfil, and Rodl [13j, Hahn and Thomassen [17] 
and Albert, Frieze, and Reed [1] (correction in Rue [21]) considered colorings of the edges of the 
complete graph Kn where no color is used more than k times. It was shown in [1] that if k < n/64, 
then there must be a multi-colored Hamilton cycle. Cooper and Frieze [7] proved a random graph 
threshold for this property to hold in almost every graph in the space studied. 

There is also a history of work on random coloring (see, e.g., any of [71 [51 [151 [E])) and it has 
recently become apparent that this random setting may be of substantial utility. Indeed, a result 
of Janson and Wormald [16] on rainbow Hamilton cycles in randomly edge-colored random regular 
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graphs played a central role in the recent determination of the threshold for loose Hamiltonicity in 
random 3-uniform hypergraphs by Frieze [lOj . Roughly speaking, a hyperedge (triple of vertices) 
can be encoded by an ordinary edge (pair of vertices), together with a color. Hence, a random 
3-uniform hypergraph gives rise to a randomly edge-colored random graph. We will discuss this 
further in Section [21 when we use the reverse connection to realize one part of our new result. 

Let us now focus on the random coloring situation, where we consider the following model. Let 
Gn,p,K denote a randomly colored random graph, constructed on the vertex set [n] by taking each 
edge independently with probability p, and then independently coloring it with a random color from 
the set [k\. We are interested in conditions on n,p,K which imply that Gn,p,K contains a rainbow 
Hamilton cycle whp. The starting point for our present work is the following theorem of Cooper 
and Frieze. 

Theorem. (See fE/, Theorem 1.1.) There exist constants Ki and K2 such that if p > and 
K > K2n, then Gn,p,K contains a rainbow Hamilton cycle whp. 

The aim of this paper is to substantially strengthen the above result by proving: 

Theorem 1.1. 

(a) There exists a constant K such that if p > ^ " , then for even n, Gn,p,n contains a rainbow 

Hamilton cycle whp. 

(b) If p = '^''T'd K = (1 + 9)n, where e,6 > -^===, then Gn,p,K contains a rainbow 

Hamilton cycle whp. 

To discuss the tightness of our main theorem, let us recall the threshold for Hamiltonicity in 
Gn,p, established by Komlos and Szemeredi pj]. We find that we must have p > "■+iog iogn+aj(n) 
with uj{n) — )• 00, or else the underlying uncolored Gn,p will not even be Hamiltonian. We also need 
at least n colors to appear on the edges in order to have enough colors for a rainbow Hamilton cycle. 
Note that the earlier result came within a constant factor of both of these minimum requirements, 
while part (a) above achieves the absolute best possible constraint on the number of colors, while 
still staying within a constant factor of the minimally required number of edges (albeit only for 
even n). 

Part (b) drives both constants down to be best possible up to first order, and for all values 
of n, regardless of parity. We permit our error terms e and 9 to decrease slowly, although we do 
not expect our constraints on them to be optimal. Our discussion above shows that the trivial 
lower bound for e is around Then, if p ~ "^^j need at least n + r2(n^/^) colors just to 

ensure that whp at least n distinct colors occur on the m ~ ^nlogn edges in the graph; hence, the 
trivial lower bound for 9 is around We leave further exploration to future work, and highlight 
a potential answer in our conclusion. 

This paper is organized as follows. We begin by establishing part (a) of Theorem 11.11 in the 
next section. Section 3 outlines the proof of part (b), which is the main contribution of this 
paper. The proofs of the main steps follow in the section thereafter. We conclude in Section 
5 with some remarks and open problems. The following (standard) asymptotic notation will be 
utilized extensively. For two functions /(n) and g{n), we write f{n) = o{g{n)), g{n) = uj{f{n)), 
or /(n) <C g{n) if lim„^oo f{n)/g{n) = 0, and /(n) = 0{g{n)) or g{n) = Q(/(n)) if there exists a 
constant M such that |/(n)| < M\g{n)\ for all sufficiently large n. We also write f{n) = Q{g{n)) 
if both f(n) = 0{g{n)) and /(n) = 0(g(n)) are satisfied. All logarithms will be in base e 2.718. 
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2 Colored graphs and 3-uniform hypergraphs 



In this section we prove part (a) of our main theorem, and demonstrate the connection between 
rainbow Hamilton cycles in graphs and loose Hamilton cycles in 3-uniform hypergraphs. Indeed, this 
will allow us to realize part (a) as essentially a reformulation of the following recent result of Frieze 
|10j . Let -ffn,p;3 denote the random 3-uniform hypergraph where each potential hyperedge appears 
independently with probability p. In this object, a loose Hamilton cycle is a permutation of the 
vertices (ui, . . . ,Vn) such that {vi,V2-,v-i}, {^^3,1^4,^5}, • • • , {vn-i,Vn,vi} all appear as hyperedges 
(n must be even). 

Theorem 2.1. (See JJUl, Theorem 1.) There is an absolute constant K such that ifp > ^^"i"' , then 
for all n divisible by 4, the random hypergraph Hn^p-s on the vertex set {xi, . . . ,Xn/2,yi, ■ ■ ■ ,yn/2} 
contains a loose Hamilton cycle whp. Furthermore, one can find such a cycle of the special form 
ixa{i),yT{i),Xa{2),yr(2), ■ ■ ■ , Xa{n/2) , yT{n/2)) , for somc pcrmutations a,T G Sn/2- 

As mentioned in the introduction, this theorem was proven by connecting loose Hamilton cycles 
in random 3-uniform hypergraphs with rainbow Hamilton cycles in randomly edge-colored random 
graphs, and applying a result of Janson and Wormald [16]. We will use the reverse connection to 
demonstrate the equivalence between Theorem II. iT a) and Theorem 12. 1[ 

Proof of Theorem 11.11 (a) . Let K be the constant in Theorem 12.11 We are given an even integer 
n, and a graph G ~ Gn^p with p > on vertex set {vi, . . . ,Vn}, whose edges are randomly 

colored from the set {ci, . . . , c„}. Construct an auxiliary 3-uniform hypergraph H with vertex set 
{vi, . . . Vn, ci, . . . , Cn}, by taking the hyperedge {vi, Vj, c^} whenever the edge ViVj appears in G, with 
color Cfc. Note that every such hyperedge appears independently with probability ^ > ^^^f"' , since 
ViVj appears in G with probability p, and receives color Ck with probability ^. Therefore, Theorem 
12. II implies that H has a loose Hamilton cycle whp, of the form {va-(i),CT-(i),Va-(2) 1 Ct-{2), • • • , Ct-(„/2))- 
This corresponds to the Hamilton cycle (f ^(i) , • • • , v„(^n/2) ) iii G, with edges colored Ct-(i) , . . . , Ct-(„/2) ■, 
hence rainbow. □ 



3 Proof of Theorem lUJ(b): high level description 

Let e,9 > ^^^^^^^ ^ be given. We will implicitly assume throughout (when convenient) that they 
are sufficiently small. Our proof proceeds in three phases, so our parameters come in threes. Let 
us arbitrarily partition the k = (1 + 9)n colors into three disjoint groups Ci U C2 U C3, with sizes 

\Ci\=ein, IC2I = (l + ^2)n, \G3\=93n. 

We will analyze the random edge generation in three stages, so we define the probabilities 

_eilogn _(l + e2)logn _e3logn 

2n ' 2n ' 2n 

The e's and 0's are defined by the relations 

ei= £2 = 63 = 1, 01 =e3 = min|^,||, 02 = 6-01-93. (1) 
(We would have taken 0i = ^2 = ^3 = f , except that Lemma HTOl requires + 6*3 < y.) 



3 



3.1 Underlying digraph model 

It is more convenient for our entire argument to work with directed graphs, as this will allow 
us to conserve independence. Recall that Dn,p is the model where each of the n(n — 1) possible 
directed edges appears independently with probability p. We generate a random colored undirected 
graph via the following procedure. First, we independently generate three digraphs = Dn,pi, 
D2 = Dn^p^, and = Dn,ps, and color all of the directed edges from the full set of colors. 

We next use the D° to construct a colored undirected graph G, by taking the undirected edge 
uv if and only if at least one of uti or vti appear among the D°. The colors of the undirected edges 
are inherited from the colors of the directed edges, in the priority order D^, D2, D^. Specifically, 
if or appear already in £)° , then uv takes the color used in D° even ifuiovvi, appear again 
in Z?3, say. In the event that both and appear in D^, the color of is used for uv with 
probability 1/2, and the color of is used otherwise. Similarly, if neither of ui nor trh appear in 
Dl, but ui appears in both D2 and D^, the color used in D2 takes precedence. It is clear that the 
resulting colored graph G has the same distribution as Gn^p^nj with 

p = 1 - (1 - p,)\l - p2)\l - psf = {! + € + 0(e'))^. 

3.2 Pcirtitioning by color 

In each of our three phases, we will use one group of edges and one group of colors. Since each D° 
contains edges colored from the entire set Ci U C2 U C3, for each i we define Di C D° to be the 
spanning subgraph consisting of all directed edges whose color is in Cj. 

Our final undirected graph is generated by superimposing directed graphs and disregarding the 
directions. Consequently, we do not need to honor the directions when building Hamilton cycles. 
To account for this, we define three corresponding colored undirected graphs Gi, G2, and G3. These 
will be edge-disjoint, respecting priority. 

The first, Gi, is constructed as follows. For each pair of vertices u, v with ui & Di but vtt Z?J, 
place uv in Gi in the same color as n^. If both ui and vi, are in Di, we still place the edge uv in 
Gi, but randomly select either the color of ui or of vi,. However, ii u^ E Di but vi, E Dl\ Di, 
then uv is only placed in Gi with probability 1/2; if it is placed, it inherits the color of ui. Note 
that this construction precisely captures all undirected edges arising from D^, using colors in Ci. 

We are less careful with G2, as our argument can afford to discard all edges that arise from 
multiply covered pairs. Specifically, we place uv G G2 if and only iiui E D2\Dl and ir^ U • 
As the pair {u, v} is now spanned by only one directed edge in D^, the undirected edge uv inherits 
that unique color. We define G3 similarly, placing uv G G3 if and only if G £)3 \ {Dl U D2) and 
D^UD^UDl 

In this way, we create three edge-disjoint graphs Gj. By our observations in the previous section, 
we may now focus on finding an (undirected) rainbow Hamilton cycle in Gi U G2 U G3. Importantly, 
note that in terms of generating colored undirected edges, the digraph Dl has higher "priority" 
than D2 or D^. So, for example, the generation of Gi is not affected by the presence or absence of 
edges from D2 or D^. 



4 



3.3 Main steps 

We generally prefer to work with Gi and Di instead of D° because we are guaranteed that the edge 
colors lie in the corresponding d. This allows us to build rainbow segments in separate stages, 
without worrying that we use the same color twice. Let df{v) denote the out-degree of v in Di. 
We now define a set S of vertices that need special treatment. We first let 

'S'o = 'S'o,! U 5*0,2 U 5o,3, 

where 

5o,i = {^:4M<^logn| (2) 
'S'0,2 = : dtiv) < ^ log n| (3) 

5o,3 = {^:4M<^log^}- (4) 

Also, define 7 = min{|, ^ei9i, ^£3^3}, and note that the constraints on e,9 in Theorem 11.11 imply 
the bound 

1 1 1 / 100 \^ 1 



^ 4 3 12 V Vlog log n ) ^ log log n ^ 
Lemma 3.1. The set Sq satisfies \S()\ < ^n^^^ whp. 

The vertices in Sq are delicate because they have low degree. We also need to deal with vertices 
having several neighbors in S'o. For this, we define a sequence of sets S'o, Si, . . . , Sf in the following 
way. Having chosen St, if there is still a vertex v ^ St with at least 4 out-neighbors in Sf (in any of 
the graphs Di, D2, or D3), we let St+i = StU {v} and continue. Otherwise we stop at some value 
t = T and take S = St- 

Lemma 3.2. With probability 1 — o{n~^), the set S contains at most n^~'^ vertices. 

To take care of the dangerous vertices in S, we find a collection of vertex disjoint paths 
Qi,Q2, ■ ■ ■ ,Qs, s = \S\ such that (i) each path uses undirected edges in G2, (ii) all colors which 
appear on these edges are distinct, (iii) all interior vertices of the paths are vertices of S, (iv) every 
vertex of S appears in this way, and (v) the endpoints of the paths are not in S. Let us say that 
these paths cover S. 

Lemma 3.3. The graph G2 contains a collection Qi, Q2, . . . ,Qs of paths that cover S whp. 

The next step of our proof uses a random greedy algorithm to find a rainbow path of length 
close to n, avoiding all of the previously constructed Qi. 

Lemma 3.4. The graph G2 contains a rainbow path P of length n' = n— sf^^ whp. Furthermore, 
P is entirely disjoint from all of the Qi, and all colors used in P and the Qi are distinct and from 
G2. 
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Let U be the vertices outside P. (Note that U contains all of the paths Qi.) In order to link 
the vertices of U into P, we split P into short segments, and use the edges of G3 to splice U into 
the system of segments. We will later use the edges of Gi to link the segments back together into 
a rainbow Hamilton cycle, so care must be taken to conserve independence. The following lemma 
merges all vertices of U into the collection of segments, and prepares us for the final stage of the 
proof. Here, d^{v; A) denotes the number of Di-edges from f to a set A. Let 

f 40 7 1 

L = max 1 15 • e^afa > ^ | > (6) 
and note that our conditions on e,9 in Theorem II. H together with ([1]), imply that £3^3 > \ ' 

L < max {15 • 1°^'°^", 7 • ^^^^^^} < ^1^- (7) 

Lemma 3.5. With probability 1 — o(l), the entire vertex set can be partitioned into segments 
Ii,...,Ir, with r < j^, such that the edges which appear in the segments all use different colors 
from C2 U C3. The segment endpoints are further partitioned into AU B, with each segment having 
one endpoint in A and one in B, such that every a ^ A has d^{a;B) > ^^j^^ogn, and every 
be B has d+{b; A) > ^ logn. All of the numeric values d^ (a; B) andd^{b;A) have already been 
revealed, but the locations of the corresponding edges are still independent and uniform over B and 
A, respectively. 

The final step links together the segments Ii, . . . ,Ir using distinctly-colored edges from Gi. For 
this, we create an auxiliary colored directed graph T, which has one vertex Wk for each interval Ik- 
There is a directed edge Wjwl E F if there is an edge e £ Gi between the -B-endpoint of Ij and the 
j4-endpoint of Ik', it inherits the color of e. Since all colors of edges in F are from Ci, it therefore 
suffices to find a rainbow Hamilton directed cycle in F. We will find this by connecting F with a 
well-studied random directed graph model. 

Definition 3.6. The d-in, d-out random directed graph model D(i-in,d-out is defined as follows. 
Each vertex independently chooses d out-neighbors and d in-neighbors uniformly at random, and all 
resulting directed edges are placed in the graph. Due to independence, it is possible that a vertex u 
selects V as an out-neighbor, and v also selects u as an in-neighbor. In that case, instead of placing 
two repeated edges ui, place only one. 

Instead of proving Hamiltonicity from scratch, we apply the following theorem of Cooper and 
Frieze. 

Theorem 3.7. (See 16^, Theorem 1.) The random graph D2-in,2-out contains a directed Hamilton 
cycle whp. 

This result does not take colors into account, however. Fortunately, in equation ([6]), we define 
L to be large enough to allow us to select a subset of Gi-edges which is itself already rainbow. The 
analysis of this procedure is the heart of the proof of the final step. 

Lemma 3.8. The colored directed graph F contains a rainbow directed Hamilton cycle whp. 
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Since each directed edge of F corresponds to an undirected Gi-edge from a i?-endpoint of an 
interval to an A-endpoint of another interval, a directed Hamilton cycle in F corresponds to a 
Hamilton cycle linking all of the intervals together. Lemma 13.81 establishes that it is possible to 
choose these linking edges as a rainbow set from Ci. The edges within the intervals were themselves 
colored from (72 U C3, so the result is indeed a rainbow Hamilton cycle in the original graph, as 
desired. 



4 Proofs of intermediate lemmas 

In the remainder of this paper, we prove the lemmas stated in the previous section. Although the 
first lemma is fairly standard, we provide all details, and use the opportunity to formally state 
several other well-known results which we apply again later. 



4.1 Proof of Lemma 13.11 

Our first lemma controls the number of vertices whose degrees in Di are too small. Recall from 
Section [3TT] that the D° are independently generated. Their edges are then independently colored, 
and the edges of D° which receive colors from Ci are collected into Di. (Priorities only take effect 
when we form the Gi in Section 13. 2[ ) Therefore, the out-degrees df{v) of vertices v in Di are 
distributed as 

d, (v) r. Bm i^n .^g^^g^^g^ J > Bm ^0.99n, J 

, , , / '^ + 02 \ ( 0.49 log n\ 

4(„) ~ Bm („ - 1,K . ^^,^^;^^J > Bin (o.9!.„, 2i?!2fii5l!l 



Thus the expected size of Sq satisfies 

IE[|5o|] <n(pi+P2+P3), 



where 



Pi 

P3 



Bin 0.99n. 



0.4961^1 log n\ ei^ilogn 



n 



< 



20 



Bin 0.99n. 



0.49 log n\ logn 



, < 

n y - 20 
Bin(0.99n.^i^^i^^^^<^^^^^°S" 



n 



20 



We will repeatedly use the following case of the Chernoff lower tail bound, which we prove with 
an appropriate explicit constant. 
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Lemma 4.1. The following holds for all sufficiently large mq, where m is a positive integer and 
< q < 1 is a real number. 



Proof. Calculation yields 



Bin (m, q) < -mq 
9 



Bin (m, q) < —mq 
9 



mq/9 , s 



< e 



-0.533mg 



•mq/9 

<E 

k=l 



^m-k ^ ^ (emq\ 



\ k J 



The function {^^^ = exp {^(log C — log fc)} is increasing in k in the range < A: < eC. Thus 



Bin (m, g) < —mq 



9 \mq/9) 



mq 



e 9 



mq 



gmq{|log9e-|+o(l)) 



< e 



-0.533m5 



as claimed. 



□ 



Returning to the proof of Lemma 13. we observe that since ^ < ^ " 0.99 • 0.49, a direct 
application of Lemma 14.11 now gives 

/ 491oen\ 1 

P2 < IP Bin 0.99n, — < - • 0.99 • 0.49 log n 

L V n / 9 

-0.533-0.99-0.49 logn 



< e 

< n 



-0.258 



A similar argument establishes that pi < n O-SSSei^i ^^^^ <c n 0-258e3^'3^ This proves that E [\So\] 



Gin 



1-7- 



, where we recall our definition 7 = minji, \e2O2, j^sO^]. We complete the proof of the 
lemma by showing that |S'o| is concentrated around its mean. For this, we use the Hoeffding-Azuma 
martingale tail inequality applied to the vertex exposure martingale (see, e.g., [2]). Recall that a 
martingale is a sequence Xq, Xi, . . . of random variables such that each conditional expectation 
E [Xt+i \ Xq, . . . ,Xt] is precisely Xf. 

Theorem 4.2. Let Xq, . . . ,Xn be a martingale, with bounded differences — Xi\ < C. Then 

for any A > 0, 

f 

P [Xn >Xo + X] < exp 



2C72 



n 



Here we consider | | to be a function of Yi,Y2, . . . ,Yn where denotes the set of edges 
jt:, kj G Dl U D2 U D3, j < k. The sequence = E [jSol | 1^, . . . , If] is called the vertex-exposure 
martingale. There is a slight problem in that the worst-case Lipschitz value for changing a single 
Yk can be too large, while the average case is good. There are various ways of dealing with this. 
We will make a small change in D° = U D2 U D'^. Let D° be obtained from D° by reducing 
every degree below 5 log n. We do this in vertex order u = 1, 2, . . . , n and delete edges incident with 
V in descending numerical order. We can show that this usually has no effect on D° . 
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Lemma 4.3. With probability 1 — o{n every vertex in Gn,p with p < ^-^^"S"- has degree at most 
5 log n. 

Proof. The probability that a single vertex has degree at least 5 log n is 

5 log n 



' [Bin [n — 1, p) > 5 log n] < 



n 



5 log n 



1 . 1 log n 



n 



< 



en 1.1 log n 



5 log n n 

l.le^ 51osn 



5 log n 



n 



5 

-2.57 



SO a union bound over all vertices gives the result. 



Therefore, 



D° = D° 



□ 



1 — o{n and so if we let Z = j^o] be the size of the corresponding 



set evaluated in D°, we obtain E 



Z 



E [\So\] + o(l) = o(n^ '^). Furthermore, changing a 1^ can 



only change Z by at most 15 log n. So, we have 



Zi>E 



H — n 



1-7 



< exp 



n 



2-27/16 



2(15 log n)2n 



<o(n ^), 



completing the proof of Lemma 13.11 



□ 



4.2 Proof of Lemma [3721 

We use the following standard estimate to control the densities of small sets. 



Lemma 4.4. With probability 1 — o{n in Dn,p with p < every set S of fewer than ■ j^^t-^ 
vertices satisfies e{S) < 2\S\. Here, e{S) is the number of directed edges spanned by S. 



Proof. Fix a positive integer s < 



71 ■ ■ 



and consider sets of size s. We may assume that s > 2, 



because a single vertex cannot induce any edges. The expected number of sets S with jS"! = s and 
e{S) > 2s is at most 



log n 



n 



2s 



es 
~2^ 



2\ 2s 



logn 



n 



2s 



e^ log n 
4n 

It remains to show that when this bound is summed over all 2 < s < 4- • , "> , the result is still 

— e* log'' n ' 

o(n^^). Indeed, for each 2 < s < 21ogn, the bound is at most O ( '"^^ " ) , so the total contribution 



from that part is only O ( '"^^ " 
bound is at most 



) = o(n ^). On the other hand, for each 2 logn < s < 



75 



log^ n ' 



the 



n 



log n 



e^ log2 n 
An 



2 log n 



2 log n 



n 
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Thus the total contribution from 21o2;n < s < 4- • , is at most o(n as desired. □ 

" e* log n \ n 

We are now ready to bound the size of the set S which was created by repeatedly absorbing 
vertices with many neighbors in Sq. 

Proof of Lemma 13.21 We actually prove a stronger statement, which we will need for Lemma 
14.71 Suppose we have an initial Sq satisfying |5o| < ^n^~'^, as ensured by Lemma [3?n Consider 
a sequence Sq, S'l, S'2, ■ ■ ■ where S'^^i is obtained from S'^ by adding a vertex v ^ S[ for which 
df{v] S[) > 3 for some i. Note that when this process stops, the final set S' will contain the set S 
which our definition obtained by adding vertices with degree at least 4 into previous St- 

So, suppose for contradiction that this process continues for so long that some reaches 
n^~'^ = Note that t > ^n^~'^ . Since each step introduces at least 3 edges, we must have 

e(<S't) > 3t > 2v}~^ = 2\St\. Yet by construction, Z^i U -D2 U is an instance of Dn^q for some 
q < so this contradicts Lemma |4.4[ □ 

4.3 Proof of Lemma [3.31 

In this section, we show that for each vertex u G 5, we can find a disjoint G2-path Q 3 v which 
starts and ends outside S. We also need all colors appearing on these edges to be different. Since 
we are working in a regime where degrees can be very small, we need to accommodate the most 
delicate vertices first. Specifically, let 5*0,0 be the set of all vertices with d2{v) < ^logn, where 
d2{v) is the degree of u in G2. Although 5o,o will typically not be entirely contained within 5o,2, 
we can show that it is still usually quite small. 

Lemma 4.5. We have \So,o\ < n^-^^ whp. 

Proof. By construction, G2 ~ Gn,q2, where 

because the first factor is the probability that exactly one oi ui or vi appears in D2, the second 
factor is the probability that it receives a color from C2 , and the third factor is the probability that 
neither ui nor vti appear in V^. Hence for a fixed vertex v, its relevant degree in G2 is distributed 
as 

0.99 log 

n 



d2iv) ~ Bin (n - 1, ^2) > Bin ( 0.99re, 
Since < h ■ 0.99 • 0.99, Lemma SJ] implies that 



d2{v) < ^logn 



< r 



0.99 log n\ 1 
Bin 0.99n, — ] <-■ 0.99 • 0.99 log n 



n J - 9 

0.533-0.99-0.99 logn 



< (9) 
Therefore, E [|S'o,o|] < n • n~^'^'^^, and Markov's inequality yields the desired result. □ 

We have shown that vertices of 6*0,0 are few in number. Our next result shows that they are 
also scattered far apart. This will help us when we construct the covering paths, by preventing 
paths from colliding. 
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Lemma 4.6. Let dist2{v,w) denote the distance between v and w in G2. Then, whp, every pair 
v,w £ Sofi satisfies dist2{v,w) > 5. 

Proof. Recall that G2 ~ Gn,q2 with q2 defined as in ([8]). Consider a fixed pair of vertices v, w. For 
a fixed sequence of /c < 4 intermediate vertices xi,X2, ■ ■ ■ ^Xf^, let us bound the probability q that 
V, w both have ^2 < ^ logn, and all the edges uxi, X1X2, 2:2X3, . . . , xj^w appear in G2. First expose 
the edges vxi, X1X2, ■ ■ ■ , XkW, and then expose the edges between v and [n] \ {v,xi, . . . ,Xk,w}, and 
between w and that set. 

This gives the following bound on our probability q: 



< 



1.01 log n 



n 



k+l 



. 0.99 log n\ logn 
Bm n — 2 — /c, < 



n / ~ 10 



2 



(10) 



A calculation analogous to ([9]) bounds the Binomial probability by n'^'^"^, so taking a union bound 
over all 0{n^) choices for the Xj, for all < A; < 4, we find that for fixed v,w, the probability that 
dist2(f,t«) < 5 is at most 

^0(n^') . rnii^y^' . (,-0.52)2 ^ ^-2.04+0(1). 

fc=o V / 

Therefore, a final union bound over the O(n^) choices for v,w completes the proof. □ 

The previous result will help us cover vertices in Sq^q with G2-paths. However, the objective of 
this section is to cover all vertices of S. Although the analogue of Lemma 14.61 does not hold for 5, 
it is still possible to prove that S is sparsely connected to the rest of the graph. Recall from (0) 
that7 = min{i,iei0i,|e3^3} > i^^^- 

Lemma 4.7. With respect to edges of G2, every vertex v is adjacent to at most ^ vertices in S 
whp. (This applies whether or not v itself is in S.) 

Proof. Fix a vertex v. Let S' be the set obtained by constructing the analogous sequences to 
5g ^, 5q 2, -Sg 3, 5^, . . . on the graph induced by [n] \ {v}, where S'q ^ are defined as in ©-(jH), but 
the S[j^i are obtained by adding vertices with at least 3 (not 4) Dj-out-neighbors in S[. Clearly, S' 
contains S \ }, because the effect of ignoring v is compensated for by using 3 instead of 4. The 
advantage of using S' instead of S is that S' can be generated without exposing any edges incident 
to V. As we will take a final union bound over the n choices of v, it therefore suffices to show that 
with probability 1 — o(n~^), the particular vertex v has at most ^ neighbors in 5'. 

For this, we expose all edges of 01^02^0"^ that are spanned by [ra] Recall that our proof 

of Lemma l3.2l alreadv absorbed vertices with 3 out-neighbors (instead of 4), so we have |5'| < n^~^ 
with probability 1 — o(n^^). It remains to control the number of edges between v and S' , so we 
now expose all edges of D° U D2 U D'^. The G2-edges there appear independently with probability 
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q2 as defined in (l8|), so the probability that at least ^ edges appear is at most 



7. 



< 



< 



< 



7 

/ 1.01 log n\^/'^ 



2/7 y V ™ 

e • ■n}~'^ 1.01 log 



n 



2/7 n 



2/7 



27 log n 



2/7 



^ (21ogn)2i°s'°s" _i 

< ;^2 = '^('^ )• (11) 

Taking a final union bound over all initial choices for v completes the proof. □ 

We will cover each vertex v £ S with a G2-path by joining two G2-paths of length up to to 
2, each originating from v. It is therefore convenient to extend the previous result by one further 
iteration. 

Corollary 4.8. With respect to edges of G2, every vertex v is within distance two of at most (^)^ 
vertices in S whp. (This applies whether or not v itself is in S.) 

Proof. Fix a vertex v. Construct S' in the same way as in the proof of Lemma 14.71 exposing 
only edges spanned by [n] \ {v}. Using only those exposed edges, let T C [n] \ {v} be the set of all 
vertices in S' or adjacent to S' via edges from G2. By Lemma [4.3|. the maximum degree of G2 \ {v} 
is at most 51ogn with probability 1 — o(n~^), so |r| < n^^'^ ■ 51ogn. A similar calculation to (jlip 
then shows that with probability 1 — o(n~^), v has at most ^ neighbors in T. 

Taking a union bound over all v, and combining this with Lemma 14.71 we conclude that whp, 
every vertex has at most ^ neighbors in SUN{S), and each of them has at most ^ neighbors in S. 
This implies the result. □ 

We are now ready to start covering the vertices of 5 with disjoint rainbow G2-paths. The most 
delicate vertices are those in 5o,0) because by definition all other vertices already have G2-degree 
at least ^ logn. Naturally, we take care of 5o,o first. 

Lemma 4.9. The colored graph G2 contains a rainbow collection Qi,Q2, ■ ■ ■ , Qs' of disjoint paths 
that cover S D 5o.o whp. 

Proof. We condition on the high-probability events in Lemmas 14.51 14.61 and 14.71 and use a greedy 
algorithm to cover each v £ SCi 5*0,0 with a path of length 2, 3, or 4. Recall that G2 ~ Gn,q2, where 
q2 was specified in ([8]). We can bound q2 by 

, log n ( log n\ , „ „ X / log n ^ ^ 

92 > 1 + £2 ^ 1 - ^ • 1 - 01 - 03 . 1 - — 

n \ n \ n 



Since equation ([T]) ensures that 0i + ^3 < s-'^d our conditions on Theorem 11.11 force £2 > 
i • ^> °f the minimum degree in G2 is at least two whp. Condition on this as well. 

3 Vlog log n logn ^ to ^ f 

Now consider a vertex v £ S (1 Sqa, and let xi and X2 be two of its neighbors. If both Xi are 
already outside S, then we use xivx2 to cover v. Otherwise, suppose that xi is still in S. Since we 
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conditioned on vertices in Sq^ being separated by distances of at least 5 (Lemma 14. 6p . xi cannot be 
in 5o,0) so it has at least ^logn G2-neiglibors. These cannot all be in S, because we conditioned 
on the fact that every vertex has fewer than ^ < 2 log log n neighbors in S (Lemma 14. 7p . So, we 
can pick one, say yi, such that yixiv is a path from outside S to v. A similar argument allows 
us to continue the path from v to a vertex outside S in at most two steps. Therefore, there is a 
collection of paths of length 2-4 covering each vertex in 5 H 5*0,0 • They are all disjoint, since we 
conditioned on vertices of 5o,o being separated by distances of at least 5. 

At this point, we have exposed all G2-edges spanned by S and its neighbors, but the only thing 
we have revealed about their colors is that they are all in C2. Now expose the precise colors on 
all edges of these paths. Since we conditioned on |5o,o| < nP'^^ (Lemma 14. 5p . the total number of 
edges involved is at most 4 • ttP-"^^ < n^-"^^. The number of colors in C2 is {1 + 92)n, so by a simple 
union bound the probability that some pair of edges receives the same color in C2 is at most 

[ 2 J(TT^ = °(^)- 

Therefore, the covering paths form a rainbow set whp, as desired. □ 

We have now covered the most dangerous vertices of S. The remainder of this section provides 
our argument which covers all other vertices in S. 

Proof of Lemma 13. 3L Condition on the high-probability events of Lemmas 13. 2( 14.71 14. 9( and 

Corollary 1321 We have already covered all vertices in S'n 6*0,0 with disjoint rainbow paths of lengths 
up to four (Lemma 14. 9|) . We cover the rest of the vertices in S\Sofi with paths of length two, using 
a simple iterative greedy algorithm. Indeed, suppose that we are to cover a given vertex v £ S\Sofi. 
Since it is not in 5*0,0, it has G2-degree at least ^logn, and at most ^ of these neighbors can be 
within 5 (Lemma 14. 7p . 

Furthermore, we can show that at most 2(^)^ of f's neighbors outside 5 can already have 
been used by covering paths. Indeed, for each neighbor w ^ S of v which was used by a previous 
covering path, we could identify a vertex x E 5 adjacent to w which was part of that covering path. 
Importantly, x is within distance two of v, so the collection of all x obtainable in this way is of 
size at most (^)^, as we conditioned on Corollarv 14.81 Since every covering path uses exactly two 
vertices outside 5, the total number of such w is at most 2(^)^. Putting everything together, we 
conclude that the number of usable G2-edges emanating from v is at least 

1 , 2 /2V 1 , 

— logn 2 - >— logn. 

10 7 \7/ 11 

Expose the colors (necessarily from C2) which appear on these G2-edges. Of the total of (1 + 62)n 
available, we only need to avoid at most 4|5| which have already been used on previous covering 
paths. Since we conditioned on |5| < n^~'^ (Lemma 13. 2p . this is at most An^~^ colors to avoid. We 
only need to have two new colors to appear among this collection in order to add a new rainbow 
path of length two covering v. Taking another union bound, we find that the probability that at 
most one new color appears is at most 
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Here, the first factor of (1 + 02)n corresponds to the number of ways to choose the new color to add 
(or none at all). Since we only run our algorithm for o(n) iterations (once per vertex in S \ Sqa), 
we conclude that whp we can cover all vertices of S with disjoint rainbow G2-paths. □ 

4.4 Proof of Lemma 13.41 

In this section, we construct a rainbow G2-path which contains most of the vertices of the graph, 
but avoids all covering paths from the previous section. In order to carefully track the independence 
and exposure of edges, recall from Section 13.21 that G2 is deterministically constructed from the 
random directed graphs D°, D2, and D^. Let us consider the generation of the D° to be as follows. 
The probability that the directed edge mi appears in Di is pi ■ j^rg^-f^^qr^^ , so we expose each 

Z?! -out-degree d~l{v) by independently sampling from the Bin (ji — l,pi ■ jTpg^rf^jqrg^^ distribution. 
Importantly, we do not reveal the locations of the out-neighbors. Similarly, for D2 and D3, we 
expose all out-degrees d^(v) and d'^^v), each sampled from the appropriate Binomial distribution. 
By Lemma 14.31 df{v) < 51ogn whp; we condition on this. 

Note that from this information, we can later fully generate (say) Di and as follows. At each 
vertex v, we independently choose df{v) out-neighbors uniformly at random. This will determine 
all Di-edges. Next, for every edge which is not part of Di, independently sample it to be part of 
DI \ Di with probability pi (l — T+eT^^^qrg^) • This will determine all edges, and a similar system 
will determine all edges of D2 and D^. 

Returning to the situation where only the df{v) have been exposed, we then construct the So^i 
by collecting all vertices whose df{v) are too small, and build the sequence Sq, Si, S2, ■ ■ ■ , St- In 
each iteration of that process, we go over all vertices which are not yet in the current St- At each 
V, we expose all Dj-edges incident to St- For this section, we will only care about the D2-out-edges 
from V ^ S (initially counted by d^(f )) that are not consumed in this process. Fortunately, at each 
exposure stage, there is a clear distribution on the number of these out-edges that are consumed 
toward St, and this will only affect the number, not the location, of the out-edges which are not 
consumed. 

Therefore, after this procedure terminates, we will have a final set S, and the set of revealed 
(directed) edges is precisely those edges spanned by 5, together with all those between S and 
Vi = [n] \ S. This set of revealed edges is exactly what is required to construct the covering 
paths Qi, . . . ,Qs in Lemma [3T3l Within Vi, the precise locations of the edges are not yet revealed. 
Instead, for each vertex v G Vi, there is now a number d2{v), corresponding to the number of 
Z)2-out-edges from v to vertices outside S. 

We now make two crucial observations. First, the distributions of where these endpoints lie are 
still independent and uniform over Vi. Second, every d2{v) < 51ogn and d2{v) > ^logn — 3 > 
^ log n, because if there were 4 out-edges from v to S, then v should have been absorbed into S 
during the process. 

This abundance of independence makes it easy to analyze a simple method for finding a long 
path, based on a greedy algorithm with backtracking. (This procedure is similar to that used in 
[22J by Fernandez de la Vega.) Indeed, the most straightforward attempt would be to start building 
a path, and at each iteration expose the out-edges of the final endpoint, as well as their colors. If 
there is an option which keeps the path rainbow, we would follow that edge, and repeat. If not, 
then we should backtrack to the latest vertex in the path which still has an option for extension. 
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We formalize this in the following algorithm. Particularly dangerous vertices will be coded by 
the color red (not related to the colors of the edges in the Gn,p,K)- Let V2 C Vi be the set of all 
vertices which are not involved in the covering paths Qi. We will find a long G2-path within V2 
which avoids all of the covering paths. 

Algorithm. 

1. Initially, let all vertices of V2 be uncolored, and select an arbitrary vertex v £ V2 to use as 
the initial path Pq = {v}. Let C/q = V2 \ {v}. This is the set of "untouched" vertices. Let 
i?o = 0. This will count the "red" vertices. 

2. Now suppose we are at time t. \ Ut\ < ^^^^i— , terminate the algorithm. 

3. If the final endpoint v of Pt is not red, then expose the first ^d^iv) of u's D2-out-neighbors. 
If none of them lies in Ut, via an edge color not yet used by Pt or any of the covering paths 
Qi, then color v red, setting Ut+i = Ut, Pt+i = Pt, and Rt+i = RtU {v}. 

Otherwise, arbitrarily choose one of the suitable out-neighbors w £ Ut- Set Ut+i = Ut \ {w}. 
Expose whether mi £ D^, wi G D\, or wt E • none of those three directed edges are 
present, then add w to the path, setting P^+i = U {w} and Rt+i = Rt- Otherwise, color 
both V and w red, and set Pt+i = Pt and Rt+i = RtD {v, w}. 

4. If the final endpoint v of Pt is red, then expose the second ^d2{v) of f 's D2-out-neighbors. 
First suppose that none of them lies in Ut, via an edge color not yet used by Pt or any of 
the covering paths Qi. In this case, find the last vertex v' of Pt which is not red, color it 
red, and make it the new terminus of the path. That is, set Ut+i = Ut, let Pt+i be Pt up 
to v', and set Rt+i = RtU {v'}. If v' did not exist (i.e., all vertices of Pt were already red), 
then instead let v' be an arbitrary vertex of Ut and restart the path, setting Pj+i = {v'}, 
Rt+i = Rt, Ut+i = Ut \ {v'}. 

On the other hand, if v has a suitable out-neighbor w S Ut, then set Ut+i = Ut\ {w}. Expose 
whether mil G D^, wi G or wi £ If none of those three directed edges are present, 
then add w to the path, setting Pt+i = PtU {w} and Rt+i = Rt- Otherwise, color w red, find 
the last vertex v' of Pt which is not red, and follow the remainder of the first paragraph of 
this step. 



The key observation is that the final path Pt contains every non-red vertex which lies in V2 \ Ut- 
Since Lemmas 13.21 and 13.31 imply that IV2I > n — 3\S\ > n — 3n^~^, and we run until Ut < ^ a^j^^ . 
Lemma 13.41 therefore follows from the following bound. 

Lemma 4.10. The final number of red vertices is at most n ■ e~soo whp. 

Proof. The color red is applied in only two situations. The first is when we expose whether any 
of mi £ Dl, wi £ Dl, 01 wi £Dl hold. To expose whether mi £ D^, we reveal whether mi £ Di, 
using the previously exposed value of dl{v), which we already conditioned on being at most 51ogn. 
Since v's Di-out-neighbors are uniform, the probability that mi £ Di is at most (jz^^- If it 

is not in Di, the probability that it is in \ Di is bounded by by the description at the 
beginning of Section 14.41 The analysis for the other two cases are similar, so a union bound gives 
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that the chance that any of G wi £ D^, or wi G Z?2 hold is at most 3 • Note 
that this occurs at most n times, because each instance reduces the size of Ut by 1. Hence the 
expected number of red vertices of this type is at most O(logn), which is of much smaher order 
than ne-®(^^). 

The other situation in which red is applied comes immediately after the failed exposure of some 
k = ^d^iv) > j^logn Z)2-out-neighbors, in either of Steps 3 or 4. Failure means that all k of 
them either fell outside Ut, or had edge colors already used in Pt or some covering path Qi. Step 2 
controls |f/t| > total number of colors used in Pt or any covering path Qi is at most 

n — \Ut\, out of the {l + 92)n available. Further note that because of our order of exposure, there is 
a set T of size at most 31ogn such that u's Z)2-out-neighbors are uniformly distributed over Vi \ T. 
This is because we have exposed whether was a D2-edge, for the predecessor it of f along Pt, 
and we may also have already exposed the first half of v's -D2-out-neighbors in a prior round, which 
could consume up to ^ • 5 log n vertices. Therefore, the chance that a given out-neighbor exposure 
is successful (i.e., lands inside Ut, via one of the > unused colors), is at least 

\Ut\T\ \Ut\ 1 W n 1 



(l + o(l)) ( 1-— -V' " <e-2io^^. 



|Vi\r| {I + e2)n - \2^/l^ {l-o{l))nJ \2^/l^ {1 + 62)71) 5(logn)2/3' 
We conclude that the chance that aW k> log n fail is at most 

5(log 

Since we will not perform this experiment more than twice for each of the n vertices, linearity of 
expectation and Markov's inequality imply that whp, the final total number of red vertices is at 
most n ■ e~ 300 v"°s"^ as desired. □ 

4.5 Proof of Lemma 13.51 

At this point, we have a rainbow G2-path P of length n' > n—-^== , which is disjoint from the paths 
Qi which cover S. Recall from ([6]) and ([7]) that we defined L = max \^15e'^^^^^^^^\ ^} ^ v^Togn. 
Split P into r = segments of length L, as in Figured! If n' is not divisible by L, we may discard 
the remainder of P, because L < ^/Jogn. 

Partition the 2r endpoints into two sets Ai U Bi so that each segment has one endpoint in 
each set, but there are no vertices a £ Ai and b £ Bi which are consecutive along P. By possibly 
discarding the final interval (which will only cost an additional L < ^log n), we may ensure that 
the initial and final endpoints are both in Ai. 

The reason for our unusual partition is as follows. In our construction thus far, we already 
needed to expose the locations of some D^-edges, since they had priority over the I?2- certain 

B^B^ AiAi B^B^ A^ B^ A^ 
• •• •• •— ••• • • 

Figure 1: The long path P, divided into consecutive intervals of length L. Endpoints of successive intervals are 
adjacent via original edges of P. The set of interval endpoints has been partitioned into AiL}B\. Note that endpoints 
which are adjacent via an edge of P are always assigned to the same set. 
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locations, we have revealed that there are no -Df-edges. In particular, between every consecutive 
pair of vertices m, v on the path P, we found a L'2-edge, and confirmed the absence of any Z)^-edges. 

Fortunately, our construction did not expose any D^-edges between non-consecutive vertices of 
the path P. In particular, if we now wished, for any vertex a G vli, we could expose the number 
N of its Di-out-neighbors that lie in Bi; then, the distribution of these out-neighbors would be 
uniform over Bi. This uniformity is crucial, and would not hold, for example, if some vertex of Bi 
were consecutive with a along P. 

The proof of Lemma [33] breaks into the following steps. Recall that df{v; T) denotes the number 
of Di-edges from a vertex f to a subset T of vertices. Say that v is T-good if d'^{v; T) > logn; 
call it T-bad otherwise. 

Step 1. For every vertex v £ P\Bi, expose the value of df{v; Bi). We show that whp, the initial 
and final endpoints of P are i?i-good, and at most n • 6"^^°^" vertices of P are i?i-bad. 

Step 2. Absorb all remaining vertices and covering paths into the system of segments, using G3- 
edges that are aligned with i?i-good vertices. (See Figure [21 ) This removes some segment 
endpoints, while adding other new endpoints. Let A2 UB2 be the new partition of endpoints. 
Crucially, B2 = Bi, while \A2\ = \Ai\ by losing up to s^^^ vertices, and then adding back 
the same number. Importantly, every new vertex in A2 \ Ai will be i?i-good. 

Step 3. The system of segments can be grouped into several blocks of consecutive segments, in 
the sense that between successive segments in the same block, there is an original edge of P. 
(See Figure [3l) Also, the initial and final endpoints of each block are always of type A, and 
are all i?i-good. 

Step 4. For every vertex b E B2, expose the value of d^{b;A2). We show that whp, at most 
n ■ e~^^°^^ vertices of B2 are j42-bad. 

Step 5. For each consecutive pair of segments along the same block (from Step 3) which has either 
an A2-eiidpoint which is Si-bad or a i?2-eiidpoint which is A2-bad, merge them, together with 
a neighboring segment in order to maintain parity between A^s and B's. (See Figure [H) Let 
A^UB^ be the final partition of segment endpoints after the merging. We show that whp, all 
a G A3 have d^(a; B3) > logn, and all b £ B^ have d~l(b; A3) > logn. Furthermore, 
if we were to expose the Di-edges between A3 and B3, then each vertex a G A3 would 
independently sample df{a; B3) uniformly random neighbors in ^3, and similarly for b G B3. 

This will complete the proof because the final number of segments is {A^l < \A2\ = \Ai\ < ^. 
4.5.1 Step 1 

By construction, = (1 — o(l))^. Now consider an arbitrary vertex v G P\Bi. We have only 
exposed the numeric value of dl{v) thus far in our construction, and not where the Di-out-neighbors 
are. So let us now expose the numeric value of df{v; Bi), but again, not precisely where the out- 
endpoints are. As we observed in the beginning of Section 14. 4| we have dl{v) > ^ logn - 3. Our 
work in the previous section consumes up to one -Di-out-edge at each vertex v £ P, when we reveal 
whether mi E in the third step of the algorithm. Therefore, df{v; Bi) stochastically dominates 
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Figure 2: A covering path of S is absorbed into the system of intervals using Gs-edges. Note that the resulting 
endpoint partition still has one A-endpoint and one B-endpoint in every interval. Importantly, the direction of the 
splicing is such that all new endpoints are of type-j4, as indicated by the yl2-vertices. This is why we continue the 
new interval rightward, through to the next B-endpoint. 
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Figure 3: Evolution of block partition during absorption. Each horizontal row represents an original block, within 
which the successive segments have their endpoints connected by edges of the original path P. The vertical gray path 
from X to y represents the absorption of a new vertex into the collection of segments, involving two different blocks. 
This operation cuts the two edges between x, y and their adjacent j42-vertices, and adds back the P-edge marked 
by the asterisk. Afterward, the segments can be re-partitioned into new blocks (see the gray dotted lines), with all 
initial and final endpoints in each block of type->l, and -Bi-good. 



B2B2 



A^ha.d 
/ 



Merged consecutive intervals 



-^2-^2 



Bo 



A2 








B2 


• 








• 



Figure 4: Original edges of P are used to merge consecutive intervals in the same block, so that a bad endpoint 
can be eliminated. 
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Bin (^^^ logn — 4, (1 — o(l))-^^ . Hence we can use Lemma UTT] to bound that the probabihty that 
df{v;Bi) is too small. 



aT (v; Bt) < - • — ■ logn 

1 ^ ' ^ 9 L 20 ^ 



< e 



-0.533- logn 



o(e-wi°s") 



o(e' 



since ei, 6*1 = The expected number of such vertices in P is at most n 

times this probability. Applying Markov's inequality, we conclude that whp, the number of i?i-bad 
vertices in P is at most n • e~^^°^". This also shows that the initial and final endpoints of P are 
Si-good whp. 



4.5.2 Step 2 

At this point, our entire vertex set is partitioned as follows. We have a collection of rainbow intervals 
/i, . . . each of length exactly L. These already consume at least n — 3^^^— vertices. Since we 

discarded the remainder of P, as well as possibly the final interval, we have r > -^(n — -3^==) — 2. 
A separate collection of rainbow paths Qi, . . . ,Qs covers all vertices of S. There are also some 
remaining vertices. In this section, we will use Gs-edges to absorb the latter two classes into the 
rainbow intervals. 

Since we will not use any further G2-edges, but edges from D2 take precedence over those from 
1^3, we also now expose all edges in • Lemma 14.31 ensures that whp, no vertex is incident to 
more than 5 logn edges of Condition on this outcome. Note that by construction, we have not 
exposed the locations of any D3-edges between vertices outside S, although vertices outside S may 
have up to three exposed Z^s-neighbors located in S. 

We now use a simple greedy algorithm to absorb all residual paths and vertices into our collection 
of intervals. In each step, we find a pair of Gs-edges linking either a new Qi or a new vertex to 
two distinct intervals Ix and ly, using two new colors from C3. We will ensure that throughout the 
process, all intervals 1^ used in this way are separated by at least one full interval Iz along P. 

The specific procedure is as follows. Suppose we have already linked in t paths or vertices, and 
are considering the next path or vertex to link in. Suppose it is a path Qi (the vertex case can 
be treated in an analogous way). Let u,v be the endpoints of Qi. We need to find vertices x,y 
in distinct intervals Ix and ly such that (i) according to P, Ix and ly are separated by at least 
one full interval from each other, and from all Iz previously used in this stage, (ii) x and y are 
separated from the endpoints of Ix and ly by at least two edges of P, and (iii) if x' £ Ix is the vertex 
adjacent to x in the direction of the i?i-endpoint of Ix, and y' £ ly is the vertex adjacent to y in 
the direction of the i?i-endpoint of ly, then both x' and y' are i?i-good. We choose the direction 
of the i?i-endpoint because x' and y' will become the new ^2-endpoints of shortened intervals; see 
Figure [2] for an illustration. 

So, let F be the set of vertices in the intervals which fail properties (ii) or (iii). By Step 1, the 
dominant term arises from the endpoints because n • e~^^°s" ^ t>/j^^. ^ X' ®° 1^1 ^ X' ^l^o 
let T be the set of vertices contained in intervals that are at least one full interval away from any 
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intervals which have previously been touched by this algorithm. Since we observed at the beginning 
of this section that the total number of intervals was at least xi''^ ~ a/j^^ ) ~ 2, we have 

n \ . A . 8Ln 8n 
T > - n - - 2 - 3 • (2t) ] L> n- > n , 



since t < 3^^^— and L < -y/Iogn. Let us now find a newly-colored Ga-edge from u to a vertex of 
T\ F. Note that the number of vertices outside T \ F is at most 

We have not yet exposed the specific locations of the L^s-neighbors of u, but only know (since 
u ^ S) that d^{u) > ^l^logn, and up to three of those Ds-out-neighbors lie within S. Consider 
what happens when we expose the location w of one of u's L'a-out-neighbors which is outside S. 
This will produce a useful Gs-edge uw if (i) w lands in T \ (ii) neither mi or w^i appeared in 

or Z?2, (iii) does not appear in Dg, and (iv) the color of the edge is new. 

Let us bound the probability that w fails any of these properties. We may consider (i)-(iii) 
together, since we showed that at most vertices were outside T\ B, and we conditioned on u 
being incident to at most 51ogn edges of After sampling the location of w, we expose whether 
mi or wi appear in D° ; by the same argument as used in Lemma 14.101 the probability of each is 
at most = 0(2;). So, the probability of failing either (i) or (ii) is at most We conditioned 

at the beginning of Section [4.41 on d^{w) < 51ogn, so when we expose whether vr^ is in D^, we 
again fail only with probability at most = o(-^). Finally, when we expose the color of the 

new edge, we know that it will be in C3, so the probability that it is a previously used color is at 
most 

, , 1 2n 1 2 /I' 



Therefore, the probability that all of the > log n — 3 Ds-out-edges of u fail is at most 

-)*-^(£)\.^..*,(AV„- 

where we used the definition of L in ([6|) for the final bound. A similar calculation works for v, 
and for the separate case when we incorporate a new vertex into the intervals. Therefore, taking a 
union bound over the o(n) iterations in linking vertices and paths, we conclude that our procedure 
completes successfully whp. □ 



4.5.3 Step 3 

Step 1 established that whp, the initial and final vertices of the long path P are both i?i-good, so 
the original system of segments can be arranged as a single block, with successive intervals linked 
by edges of P. We now prove by induction that after the absorption of each path or vertex in Step 
2, the collection of segments can be re-partitioned into blocks of segments, such that within each 
block, consecutive segments have their endpoints linked via P, and the initial and final endpoints 
in each block are of type-A, and i?i-good. 

There are two cases, depending on whether the absorption involves two segments in the same 
block (as in Figure [2]), or in different blocks (as in Figure [3]). If the segments are in the same block, 
then we can easily divide that block into two blocks satisfying the condition. Indeed, in Figure O 
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one of the new blocks is the string of segments between the vertices indicated by A2 in the diagram, 
and the other new block is the complement. This works because within each of the two new blocks, 
every edge between successive segments was an edge between successive segments of the original 
block, hence in P. Also, of the four initial/final endpoints among the two new blocks, two of them 
were the initial/final endpoints of the original block, and the other two were identified as i?i-good 
vertices, now in A2. Therefore, the new block partition satisfies the requirements. 

On the other hand, if the absorption involves two segments from different blocks, then one can 
re-partition the two blocks into three new blocks, as illustrated in Figure [3j A similar analysis to 
above then completes the argument. 



4.5.4 Step 4 

We have not yet revealed anything about the Di-out-neighbors of any vertices in B2 = Bi; the 
only thing we know is that they had d'^ > logn. For each vertex b G Bi, let us now expose 
the numeric value of df{b;A2), but again, not precisely where the out-endpoints are. Since our 
absorption procedure maintained \A2\ = \Ai\ = (1 — o(l))£, the same argument that we used for 
Step 1 now establishes Step 4. 



4.5.5 Step 5 



By Steps 1 and 4, the total number of merges which occur in Step 5 is at most 0{n ■ e~^^°^"'). 
Since B3 C B2 = Bi and A3 C A2, for every a G A3 we can independently sample d^(a; i?3) using 
only the value of d'l{a]Bi). Indeed, since a G ^3 was i?i-good, it had d'^{a;Bi) > l^^logn. 
We will only have df{a;B^) < J^g^logre if at least |^logn — ^^logn = ^^jlogn of those 
out-neighbors land in Bi \ B^ as opposed to B3. Let 



\B1\B3 
\B,\ 



O 



n • e 



-Vlogr 



n/L 



Note that 



Bin 



180L 



logn, q > 



1800L 



log n 



< 



£1^1 

180L 



logn 



< (Weq) 



■ log n 



-(l-o(l))Vlogn-. 



g \- "\^// V ■•■ 1800L 
< g-f^((log")'''^^/loglog" 

= o(n-i). 



log n 



2001 logn. 
because we had A3 C A2, 



Therefore, a final union bound establishes that whp, every a G A3 has df{a] B^) > 

A similar argument establishes the bound for (i^(6; A3), for b £ B3, 
and had only previously exposed the value of df{b;A2). The last claim in Step 5 is clear from our 
order of exposure. 
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4.6 Proof of Lemma 13.81 



In this final stage of tlie proof, we use Gi to link together the endpoints of the system of segments 
Ii,...,Ir created by Lemma [3.51 As described in the overview (Section IS.Sp . we construct an 
auxiliary directed graph F. Importantly, no DJ-edges have been revealed between the endpoint 
sets A and -B, so we may now specify a model for the random r- vertex digraph F. 

Indeed, consider a vertex Wfc £ T, 1 < /c < r, and let a, b be the A- and i?-endpoints of the 
corresponding interval Ik- We first generate a set Ei of F-edges by sending exactly df{b; A) directed 
edges out of Wk, and exactly df{a; B) directed edges in to w^- This is analogous to the d-in, d-out 
model, except that not all degrees are equal. Some directed edges will be generated twice; let Fi 
be that subset, but keep only one copy in Ei. Color every edge of Ei independently from Ci. 
Finally, generate a random subset F2 C Ei\ Fi by independently sampling each edge of Ei \ Fi 
with probability ^pi ■ (jqrg^rf^^qrg^) • Let Ei \ F2 be the final edge set of F. 

The reason for the removal of F2 is that some of the initially-generated edges of F will find 

conflicts once D? is generated. Indeed, every edge uuwt that we have placed in F corresponds to 
_^ i _^ 

an edge ba G Di or ab G Di (or both), for some b ^ B, a & A. When both do not occur, and only 
ba G Di (say), then we need to expose whether G \ Di; if it is in \ Di, it removes Wjwl 
from F with probability 1/2. 

To simplify notation, let 6~^{'Wk) and d~{'Wk) be the numbers of out- and in-edges that are 
generated at Wk to build the initial edge set Ei. They correspond to df{b; A) and df{a; B) above, 
and have therefore been revealed by our previous exposures. Importantly, we have the bounds 

log n < 5^ (wk) < 5 log n. 
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It is more convenient to restrict our attention to a smaller subset E2 C Ei which is itself already 
rainbow; then, every ordinary directed Hamilton cycle will automatically be rainbow. For this, we 
expose at every vertex Wk what the colors of the 6~^{wk) out-edges and 6~{wk) in-edges will be, 
but not their locations. 

Lemma 4.11. Suppose that 6^{wk) > loBT^'^S'^ f'^''^ all 1 < k < r. Then whp, it is possible to 
select 3 out-edges and 3 in-edges from each so that all 6r selected colors are distinct. 

Proof. Construct an auxiliary bipartite graph H with vertex partition W L) Ci, where W = 
{wf ,Wi , . . . , ,w~}. Place an edge between w'^ and c if one of w^s 6~^{wk) out-edges has color 
c. Edges between and c are defined with respect to w^s in-edge colors. The desired result is a 
perfect l-to-3 matching in H. For this, we apply the l-to-3 version of Hall's theorem: we must show 
that for every X C W, we have |A^(X)| > 3\X\, where N{X) is the union of the if-neighborhoods 
of all vertices of X. 

This follows from a standard union bound. Indeed, fix an integer 1 < x < 2r, and consider an 
arbitrary pair of subsets X G W and Y C C, with |X| = x and |y| = 3x. The probability that 
N{X) CY {in H) is at most 

3x 

The innermost term is the probability that a random color from Ci is in Y . The exponents come 
from the fact that each vertex oi Wk & X samples at least 5^{wk) > ^^logn colors for its out- 
and in-neighbors. 
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Multiplying this bound by the number of ways there are to select X and Y ^ and using r < J, 
we find that the probability of failure for a fixed x is at most 



•iin 
3x 



3X \ 200L 

9^ 



^^-^ log n 



< 



rv \ X / a \ 3a; 

zer \ / oin 
2>x 



/2en 



yin\3 / 3x 



3x 



9in 



We will sum this over all 1 < x < 2r. The outer exponent allows us to bound this by a decreasing 
geometric series, so it suffices to show that the interior of the square bracket is uniformly o(l) for 
all 1 < X < 2r. Indeed, observe that the exponent of x inside the bracket is logn — 4 > 0, so 



it is maximized at x 



2r < ^. Yet 



/ 2en W 9in \ V3(2n/L) \ ^fer 
\L{2n/L) J \3{2n/L) J \ Bin J 



logn 



6*1 L 



200L 



log n— 3 



Since ^ ensures that L > J^, we have < | . Yet the exponent log n — 3 tends to infinity 
as n grows, so we indeed obtain a uniform upper bound of o(l) for all 1 < x < 2r. Therefore, whp, 
every subset X dW has |A^(X)| > 3\X\, and the l-to-3 version of Hall's theorem establishes the 
desired result. □ 

Recall from the beginning of this section that the final edge set of F is i?i \ F2 . Let E2 be the 
set of 6r edges selected by Lemma 14.111 This corresponds to a copy of -D3-in,3-out • Unfortunately, 
in our model we still need to expose the locations of the remaining 5^{wk) — 3 remaining in- and 
out-edges at every vertex Wk- It is possible that an edge of E2 may be generated again in this 
stage. That edge would then have 1/2 probability of receiving the color of the new copy, which 
would not be in our specially constructed rainbow set. To account for this, let be the set of 
edges generated by exposing these remaining in- and out-edges, so that E2U E^ = Ei. It suffices 
to find a directed Hamilton cycle in E2 \ {E^ U i*2). 

It is not convenient to work directly with E3 or F2, because they depend on the result of £'2- Let 
F3 be the random directed graph Dr^q defined by sampling each edge with probability q = lMLi2£Ii^ 
and generated independently of £'2. Fortunately, we can control F3 instead. 

Lemma 4.12. There is a coupling of the probability space such that E^L) F2 C F3 whp. 

Proof. The set F2 is a random subset of Ei \ Fi obtained by independently sampling each edge 
with a probability of at most , so it is clearly contained in a copy of D , logn that is generated 
independently of E2. Next, although £3 is exposed after E2, it is still contained in a copy of 



D 



(6 log n)-in, (6 log n)-out 



that is generated independently of E2. Indeed, at a vertex Wk, we generate 



£3 by exposing S^{wk) — 3 new out- and in-edges, but 6^{wk) < 5 logn. 

We observe a standard coupling which realizes -D(6iogra)-in,(6iogn)-out as a subgraph of 120 log n 

' r 

whp. For this, consider the following system for generating a random directed graph. For every 
ordered pair of vertices {u,v), generate two independent Bernoulli random variables I^^ and 
each with probability parameter " . Create the directed edge ui, if and only if at least one of 



23 



or I^^^ took the value 1. This is clearly contained in Z?r,g- For each vertex u, let D^{u) be the 
number of other vertices v for which = 1, and let D~{u) be the number of = 1. These 
are all distributed as Bin — 1, ^oiog" ^ _ Lemma l4.ll establishes that for fixed u, the probability 
that D^(u) < 61ogn is at most g-0-533-(60-o{i))iogn ^ ^^^^ similarly for D^{u). Therefore, a union 
bound establishes that whp, all D^(n), D~{u) > 61ogn. Conditioning on the values of D~^{u) and 
D^{u), we see that when the indicators are revealed, this indeed contains i^{6iogn)-in,(6iogn)-out- 
The result then follows by recalling that r = (1 — o(l))^, and taking the union of D , 120L log n with 
another independent D logn to cover F2. □ 



We are now ready to finish the final lemma in our proof of Theorem 11.11 

Proof of Lemma 13.31 We have established that it suffices to find an ordinary directed Hamilton 
cycle in F3, without regard to color. Conveniently, F3 is now independent of E2 ~ -Ds-in.s-out • 
This independence allows us to first expose how many of the 3-in, 3-out edges at each vertex will 
be in F^, and then only expose the locations of those that are not in F^. At each vertex n, the 
probability that more than one of the 6 in- or out-edges is in F^ is at most 

6\ /130Llogn\^ , _i, 
' = o(n ), 



2/ V n 

so a union bound establishes that whp, E2 \ F^ contains a copy of -D2-m,2-out- This is known to be 
Hamiltonian whp by Theorem 13.71 from Section [3.3^ so our proof is complete. □ 



5 Concluding remarks 

Our main contribution, part (b) of Theorem 11.11 sharpens the earlier result of Cooper and Frieze 
[8] to achieve optimal first-order asymptotics. As we mentioned in the introduction, we suspect 
that our result can be further sharpened within the o(l) term. We do not push to optimize our 
bounds on e, 9 because it is not clear that incremental improvements upon our current approach 
will be sufficiently interesting. Instead, we would be more interested in determining whether one 
can extend the celebrated "hitting time" result of Bollobas @] , which states that one can typically 
find a Hamilton cycle in the random graph process as soon as the minimum degree reaches two. 

Question 5.1. Consider the edge-colored random graph process in which ei, 62, . . . , ejy, N = (2) is 
a random permutation of the edges of Kn- The graph Gm is defined as ([n],{ei,e2, . . . ,em})- Let 
each edge receive a random color from a set C of size at least n. Then whp, the first time that a 
rainbow Hamilton cycle appears is precisely the same as the first time that the minimum degree of 
Gm is at least two and at least n colors have appeared. 

Although this may be out of reach at the moment, another natural challenge is to settle the 
problem in either the case when the number of edges is just sufficient for an ordinary Hamilton 
cycle, or in the case when the number of colors is, say, exactly n. Part (a) of our theorem answered 
the latter question when n was even. We believe that it is probably also true when n is odd. It 
would be nice to prove that extension, either directly or by reducing the divisibility condition in 
Theorem 12. 1[ 
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